GENERALIZED GALOIS NUMBERS, INVERSIONS, 
LATTICE PATHS, FERRERS DIAGRAMS AND LIMIT 
>S|"' THEOREMS 

o 

(S| ■ SVANTE JANSON 

Abstract. Bliem and Kousidis recently considered a family of ran- 
dom variables whose distributions are given by the generalized Galois 
numbers (after normalization). We give probabilistic interpretations of 
-s.1 , these random variables, using inversions in random words, random lat- 

tice paths and random Ferrers diagrams, and use these to give new 
proofs of limit theorems as well as some further limit results. 

o 
u 

+-> ' 1. Introduction 

<&. . 

The homogeneous multivariate Rogers-Szego polynomial in m > 2 vari- 
ables is denned by 

H n (t u . . . ,t m ) := Yl (u \)<r"t (1-1) 

00 ' 

where {. n k J is the q-multinomial coefficient (or Gaussian multinomial 

coefficient) 

en '. 

S : U n fc ) : = tu ii [U \ 1, forn = k 1 + ... + k m , (1.2) 

\ki,...,k m J [ki]L ■ ■ ■ [k m ]\ q 






where [k]\ q := [l] 9 [2] g • • • [k] q with [£} q : = (1 — q £ )/(l — q). Equivalently, one 
might consider the inhomogeneous multivariate Rogers-Szego polynomial 



H n (t\, . . . ,t m -i) :— H n (ti, . . . ,t m -i, 1). (1.3) 

For these polynomials, see Rogers [13| |. Andrews [1| and Vinroot [171 ] . 
We concentrate here on the special value 

Gt ) (q)=H n (l,...,l)=H n (l,...,l)= E ( k n k )' {lA) 

studied in Vinroot [17j and Bliem and Kousidis [2|. This is a polynomial in q. 
In the special case m = 2, studied in e.g. Goldman and Rota J4|, Nijenhuis, 



Solow and Wilf [ll|] , Kac and Cheung [lOj, Chapter 7] and Hitzemann and 
Hochstattler [6], these numbers G n (q) are known as Galois numbers, and 
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2 SVANTE JANSON 

the numbers Gn ar e therefore called generalized Galois numbers by [171 ] 
and [i|. Note that 

G^(l)= £ ( H )=m», (1.5) 

felH hfem=n 

by the multinomial theorem. 

Bliem and Kousidis |2( noted that the polynomial G™ (q) has non-negative 
coefficients, and thus 

^ m) (9) := ^y^ = m^G^g) (1.6) 

c£ m) (l) 

can be interpreted as the probability generating function of a random vari- 
able G n>m . We let Q n ^m denote the probability distribution with the prob- 
ability generating function (jl.6p . and have thus G n<m ~ 9n,m- (We use, 
following |2j|, G nim for an arbitrary random variable with this distribution. 
In the next sections we will construct specific random variables of this type.) 
The purpose of the present paper is to provide some probabilistic in- 
terpretations of this random variable, see Sections HHH and to use these 
interpretations to give new, and perhaps simpler, proofs of the following 

results in [2j]. We use — > for convergence in distribution and (later) = for 
equality in distribution. N(fi,a 2 ) is the normal distribution with mean /x 
and variance a 2 . 

Theorem 1.1 ([2j]). The random variable G n ^ m has mean and variance 

E^^.^, (1.7) 

4 m 



n(n — l)(2n + 5) m? 
~T2 m 

Theorem 1.2 ([2(]). If m — > oo with n > 1 /feed, t/iera 



VarG^^ '^" 1 ^^^ .^. (1. 



Gn )m — > G n , (1-9) 

where G n is the number of inversions in a random permutation of {1, ... ,n}. 
Theorem 1.3 (j2(). If n — ?■ oo wii/i m>2 fixed, then 



Var(G nim ) / 
equivalently, 



iV(0,l); (1.10) 



(-;„.,„ -EG n , m d.^mMy (i.ii) 



ra 3 / 2 V ' 36m 2 

Furthermore, we can also let both m and n tend to infinity; we show that 
there are no surprises in this case. 
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Theorem 1.4. If m, n — > oo, then 

G„ m. — JH' Cr r , 



Var(G„, m ) 1 /2 



#(0,1); (1.12) 



(-;,.,,„ -EG„, m d , / i (j |;]) 



equivalently, 

Moreover, we show a local limit theorem strengthening Theorems 11.31 
andOJ 

Theorem 1.5. Ifn-+ oo, then, with fj, n>m := EG„ im and a 2 m := Var G n<m 
given by Theorem \l.l\ 

<Jn, m nGn, m = k) = -L e -( k -^ 2 / 2 <™+o(l), (1.14) 

V27T 



uniformly in all m > 2 and /c G Z. 

Equivalently, we can in (|1.14p replace fJ, n> m an d cr 2 , m by the approxima- 
tions Ji n , m := ^n 2 and a 2 nm : = fj^-n 3 . 

Proofs are given in Sections [SHU 

Remark 1.6. The name (generalized) Galois numbers comes from the fol- 
lowing algebraic interpretation, see |J|, [17]], [10, Chapter 7], [la, Proposition 
1.3.18] which, however, not will be important in the present paper. 

If q is a prime power and V an n-dimensional vector space over the Galois 
field F q with q elements, then it is not difficult to see that ( fc n k ) is the 

number of flags {0} C V\ C • • • C V m = V, where Vi is a subspace of 

dimension k\ H \-ki. Hence, Gn (q) is the total number of such flags of 

(2) 

fixed length m in V = F™. In particular, the Galois number G n (q) is the 
number of subspaces of F?. 

2. Inversions 

If w = w\ ■ ■ ■ w n is a word with letters from an ordered alphabet A, then 
the number of inversions in w is the number of pairs (i,j) with i < j and 
Wi > w j] we denote this number by Inv(w). Using the notation 1{£} for 
the indicator of an event £ , we thus have 

Inv(w) = Y^ H* * >Wj}. (2.1) 

l<i<j<n 

With the alphabet A = {1, . . . , m}, it is well-known (and not difficult to 
see) that the q- multinomial coefficient ( n ) , where n\ + \- n m = n, 

\ni ,...,n m / q 

is the generating function of the number of inversions in words consisting of 
n\ l's, . . . , n m m's, in the sense that if a,m,...,n m (£) is the number of such 
words with exactly £ inversions, then 

(v OO 

= J> ni ,..., nm (^, (2.2) 
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see [l|, Theorem 3.6]. 

Summing over all m, . . . , n m with n\ + • • • + re m = n, we immediately 
obtain the following from (II. 4p and (|2.2j> . 



Theorem 2.1. Gn (<z) * s ^ e generating function of the number of inver- 
sions in words of length n in the alphabet {1, . . . , m}, in the sense that if 
An (£) is the number of such words with exactly £ inversions, then 

oo 

GtHq) = Y,AtH£)q £ . (2.3) 

t=o □ 

By the definition of the random variable G n<m , (j2.3|) is equivalent to 

nG n , m = £) = A^\£)/n- m . (2.4) 

This can be formulated as follows, yielding our first construction of a random 
variable G n;m . 

Theorem 2.2. Let W n%m be a uniformly random word of length n in the 
alphabet {1, . . . , m}. Then the number of inversions Inv(W n)m ) has the dis- 
tribution Q n ^ m - In other words, G n ^ m = Inv(Wn >m ). □ 

We can thus choose Gr n ,m := Inv(W njm ). (Recall that we have defined 
Gn t m to be an arbitrary random variable with the desired distribution.) 

If we write the random word W nm as X\ ■ ■ ■ X n , we have X±, . . . , X n i.i.d. 
(independent and identically distributed) with the uniform distribution on 
{1, . . . , m}, and using (|2.1|) . Theorem 12.21 may be reformulated as follows. 

Corollary 2.3. Let {Xj}?^ be i.i.d. random variables, with every Xj uni- 
formly distributed on {1, . . . , m}, and let 

V n , m := J2 1 { x i> X j}- ( 2 -5) 

l<i<j<n 

Then V n , m ~ Q n ,m- In other words, G n>m = V n , m . □ 

Let Nk := #{i < n : X{ = k} be the number of occurences of the 
letter k in the random string W n , m = X\- ■ ■ X n . Then (N\, . . . , N m ) has a 
multinomial distribution with E N^ = n/m, and it is well known that if we 

keep m fixed, n -1 ' 2 (N^— E Nk)™ =1 — > (^fc)feLi as n — >• oo, where Z\, . . . , Z m 
are jointly normal with means EZ(. = 0, variances Var Z^ = (m — l)/m 2 and 

covariances Cov(Zfc,Z;) = — 1/m 2 (k ^ I). By Theorem 11.31 V n ^ m = G n/m 
has an asymptotic normal distribution, and this extends to joint asymptotic 
normality of V n m and N\, . . . , N m . 



Theorem 2.4. For fixed m, as n — )• oo, 

T-EK m m-ENt N m -EN m 



o/n ) I/O >•••! 1/n i ' v" i^r-'i^mj! 



\Z , Z\i — j Z n 
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where Z* , Z\, . . . , Z m are jointly normal with means 0, Var Z* = (m 2 — 
l)/36m 2 as in (jl.lip . Z\, . . . , Z m have the variances and covariances given 
above and Z* is independent of Z\, . . . , Z m . 

The proof is given in Section 

3. A [/-STATISTIC 

Let {Xi}°^ =1 and {Yi}°2 =1 be independent random variables, with every 
Xi uniformly distributed on { 1 , . . . , m} and every Yi uniformly distributed 
on [0,1]. (Any common continuous distribution of Yi would yield the same 
result.) 

Fix n > 1. The values Yy, . . . , Y n are a.s. distinct, and can thus be ordered 
as 5^(1) < • • • < Y ff i n \ for some (unique) permutation of {1, . . . , n}. Let W n ^ m 
be the word X a {\\ ■ ■ ■ X ff ( n \. Since {A^}™ =1 and {Y;}™ =1 are independent, 
Wn )Tn has the same distribution as X\ ■ ■ ■ X n , and is thus a uniformly random 
word in {1, . . . ,m} n . Consequently, Theorem 12.21 yields Inv(W ntm ) ~ G n ,m- 
Moreover, since i < j <^=^ Y ff n\ < Y a u\, 

n 

ln\(W n . m ) = Y^j 1 i X <T(i) > X <r{j)} = Y 1 i X ^{i) > X °U) aIld i < J) 
l<i<j<n *J=1 

n 

= Y 1 { X <?(i) > X *U) aIld Y °(i) < Y °(j)} 
n 

= Y H x k > Xi}l{Y k < Y t }. 

k,l=l 

We have shown the following, yielding our second construction of G n ^ m . 
Theorem 3.1. Let Xi and Yi be as above, and define the random variable 

Ti 

U n , m := Y H x i > X j}HYi < Yj}- (3-1) 

Then U n)Tn ~ Q n ,m- In other words, G n ^ m = U n ^ m . □ 

Let Zi := (Xi, Yi); this yields a sequence of i.i.d. random vectors taking 
values in S := {1, . . . , m} x [0, 1]. Define the functions h, h* : S 2 — > R by 

h({xi,yi), (x 2 ,y2)) ■= l{xi > Xj}l{yi < y,}, (3.2) 

h*((xi,yi), (x 2 ,2/ 2 )) := h((xi,yi),(x2,y 2 )) + h((x 2 ,y2), (xi,yi)). (3.3) 

Thus h* is symmetric and (|3.ip can be written 

n 
U n , m =Y h ( Z ^ Z j) = J2 h * ( Zi ' Z i) ' ( 3 - 4 ) 

which shows that U n ^ m is (for fixed m) a [/-statistic [7(. 
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4. Lattice paths and Ferrers diagrams 

In this section we consider the special case m = 2. In this case, there 
is an alternative combinatorial description of the Gaussian binomial coeffi- 
cients using using lattice paths instead of inversions, see Polya [12J]. Indeed, 
consider lattice paths in the first quadrant, starting at the origin and con- 
taining n unit steps East or North. There are 2 n such paths, and they may 
be encoded by the 2 n words of length n with the alphabet {E, N}. The area 
under each horizontal step equals the number of previous vertical steps, so 
by summing, we see that the area under the path equals the number of 
inversions in the corresponding word, where we use the ordering E < N. 

Consequently, Theorem 12.21 yields the following. 

Theorem 4.1. Let 9(n) be the area under a uniformly random lattice path 

(of the type above) of length n. Then 9{n) ~ Q n ,2- In other words, G n ^ = 
9{n). 



The random variable 9{n) was studied by Takacs 161 ) . who found its mean 
and variance and proved a central limit theorem and a local limit theorem 
(our Theorems II .1\ [L~3l and fl~5l for m = 2). 

By symmetry, we may instead consider the area 9'(n) between the path 
and the y-axis. This area can be regarded as a Ferrers diagram; if the path 
ends at (si,S2), then the height (number of non-empty rows) h and width 
w of the Ferrers diagram satisfy h < S2 and w < s\, and there is a bijection 
between all paths ending at (s\,S2) and all such Ferrers diagrams. (Note 
the bijection between such Ferrers diagrams with a given area N and the 
partitions of N into at most S2 parts, each at most si; see [l|, Theorem 3.5].) 

Alternatively, by adding an extra row and column, we obtain a Ferrers 
diagram with height S2 + 1 and width s\ + 1; its right boundary consists of a 
path from (—1,0) to (s\, S2 + 1), beginning with a horizontal step and ending 
with a vertical. Moreover, there is a bijection between all paths ending at 
(si,S2) and all such Ferrers diagrams. We further see that the area of this 
Ferrers diagram equals 9' + s\ + 82 + 1, where 9' is the area between the 
(original) path and the y-axis. 

The semiperimeter of a Ferrers diagram equals its height plus width, 
and we thus have obtained a bijection between all Ferrers diagram with 
semiperimeter n + 2 and all (north-east) lattice paths of length n. This 
bijection gives a correspondence between uniformly random Ferrers diagrams 
with semiperimeter n + 2 and uniformly random lattice paths of length n, 
yielding the following theorem. 

Theorem 4.2. Let A n be the area of a uniformly random Ferrers diagram 

with semiperimeter n + 2. Then A n — n — 1 ~ Q n ,2- In other words, G n ^ = 
A„. — n- 1. 
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Proof. If 6'(n) is the area between the corresponding random lattice path 
and the y-axis, then the arguments above show that 

A n = 9'{n) + n + 1 = 9(n) + n + 1 
and the result follows by Theorem 14.11 □ 

Corollary 4.3. The random variable A n has mean and variance 

T7 2 i , j ri _i_ a 

EA n = EG n , 2 + n + l= , (4.1) 

o 

a ^r ^ n(n — l)(2n + 5) lA _,. 

Var A n = Var G n , 2 = -* ^ '-. (4.2) 

Proof. By Theorems 14.21 and 11.11 □ 

Theorem 11.31 yields the central limit theorem 

N(0,1); (4.3) 



A n JH j4 n d 



Var^) 1 / 2 
by (J4.ip - (j4.2p . this can also be written as 



A.-78 -.^0,1, (4 . 4) 



n 3 / 2 V '48, 

which was proved by other methods by Schwerdtfeger 14j. Furthermore, 
Schwerdtfeger 14| showed that if -ff n is the height of the Ferrers diagram, 
then there is joint convergence of the normalised variables 

where CljC 2 are independent standard normal variables. The asymptotic 
normality of H n is immediate, since H n — 1 is the y-coordinate of the end- 
point of the corresponding lattice path, and thus H n — 1 has the binomial 
distribution Bi(n, 1/2). The joint convergence follows by Theorem 12.41 

5. Proofs of Theorems I1.1H1.4I and 12.41 

We will base most of the proofs on the representation in (|3.ip - (|3.4p . (It 
is also possible to use (12. 5j) . see Remark 15.31 and the proof of Theorem I2.4( 
(|2.5p is simpler in some ways, but we prefer the symmetry in (13.ip ~ (13.4p .) 

We use the notations, with Z, h, h* as in Section El see (j3.2[) — (j3.3|) . 

Iij := h(Z u Zj) = l{Xi > XjjliYi < Yj}, (5.1) 

I* j :=h*(Z i ,Z j ) = I ij + I ji . (5.2) 

Thus (|3.4p can be written 

l<i<j<n 
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Proof of Theorem \1.1[ By symmetry and the independence of Iij and Iki 
when {i,j} and {k,l} are disjoint, fj5.3f) implies 

E G n , m = Q Eijj = n(n - 1) E I 12 , (5.4) 



Var G n , m = r\ Var J* 2 + n(n - l)(n - 2) Cov(/* 2 , J* 3 ) . 



(5.5) 



Clearly, 



( m ) 1 m - 1 

E li,. = ¥(X t > X^FfXi <Y j ) = ^l-- = — (5.6) 

j j j m z 2 4m 



and 

m — 1 ' ' 

EI* i = 2EI ij = - — - = !-—■ (5.7) 

13 3 2m 2 2m v ' 

any of these yields (jl.7j) by (|5.4p . 

Since /*• is 0/1-valued, it follows from (J5.7I) also that 

Var/*. = E/*.(l-EJ*.) = J(l--L)- (5.8) 

Furthermore, again using symmetry, 
E(iT 2 ii 3 ) = 2E(/ 12 / 13 ) + 2E(/ 21 I 13 ) 

= 2P(Xi > x 2 ,x 3 ) p(Yi < y 2 ,y 3 ) + 2P(x 2 > x x > x 3 ) p(y 2 <y 1 < y 3 ) 

= 2 E"i(^-l) 2 . 1 + 2 CH . I = m(m-l)(2m-l) + m(m-l)(m-2) 
m 3 3 to 3 6 9m 3 18m 3 

(m — l)(5m — 4) 



18m 2 
and hence 

v 2 (m— l)(5m — 4) (m — l) z 



Covfe/^) =E(/iVi*3) -E(/S,V - ls ,„> lni 2 

(m — l)(m + 1) 



(5.9) 



36m 2 
The variance formula (11 .81) follows from (15. 5h . (15. 8p and (15.9p . D 

Proof of Theorem li.jfl Consider the random word W n ^ m = X\ ■ ■ ■ X n in The- 
orem [!T2j If we condition on the letters X\, . . . , X m being distinct, then the 
number of inversions Inv(W nim ) has the same distribution as the number 
G n of inversions in a random permutation. Hence, for any set icN, 

P(lnv(W„, m ) £ A\X 1: ...,X n distinct) = P(G n <G A) 

and thus 

|P(lnv(W n , m ) e A) - P(G n e A) | < P(Xi, . . . ,X„ not distinct) 

< (fi P(X 1 = X 2 ) = Si _> 
\2J m 

asm-f oo, and thus G n ^ m = Inv(W nim ) — > G n . □ 
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Remark 5.1. We have actually proved that the total variation distance 
dTv(G nt m,G n ) < (^)/m. Moreover, the bound can be improved to 

dTv{G n ,m, G n ) < P(Xi, . . . ,X n not distinct) = 1 - (m) n /m n , 

where (m) n := m\/(m — re)!. 



Proof of Theorems li.31 and \l-4\ The two versions in each theorem are equiv- 
alent by (|1.8p . so it suffices to prove, for example, (jl.lip and f)1.13j) . 

The central limit theorem Theorem 11.31 follows immediately from Hoeffd- 
ing's central limit theorem for ^/-statistics [7|] without any further calcu- 
lations. Moreover, we shall see that the decomposition method used by 
Hoeffding [7j] yields also Theorem \1A\ we therefore do the decomposition 
explicitly. 

The idea is to decompose each term I*- as 

Iij = » + Zi + Sj + Vij, (5-10) 

where u := E/*-, 

ii := E(J*. -fi\Zi)= E(J*. | Xi,Yi) - ft (5.11) 

and r/ij is defined by (|5.10j) . Then the random variables £$ (1 < i < re) and 
Vij (1 — * < 3 — n ) have mean and are orthogonal (in L 2 ), so they are 
uncorrelated. In particular, 

1 > Var /*■ = Var & + Var £,■ + Var rjij. (5.12) 

Moreover, £j = g(Zi) for some function g, and thus the variables £, are i.i.d. 
By summing (|5.10p . we obtain by (|5,3p a corresponding decomposition of 

C/n, m = ( g ) M + (" - 1) 5^ 6 + Yl %• ( 5 - 13 ) 

^ ' i=l l<i<j<n 

Hence, 

I/n ' m " ! ,f n ' m = —re" 1 / 2 V 6 + n" 3 ^, (5.14) 

i=l 

where R := Yli<i<j<n r liy Since the variables re^ are uncorrelated, and 
Var rjij < 1 by (|5.12p . we have 



R 2 =VarR= ^ Var % < (!J) < ^ ', ( 5 -l 5 ) 

Ki<j'<n ^ ' 



and thus E(re 3 ' 2 i?) 2 — )• 0. Hence, the last term in (|5.14p is a small remain- 
der term that can be ignored when re — > oo. Furthermore, the decomposition 
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(|5.13p yields the variance decomposition 

n 

Var U n , m = (n - l) 2 ^ Var ^ + ^ Var r]ij 
i=l l<i<j<n 

= n(n — 1) Var^i + I J Var 7712 

~n 3 Var^i (5.16) 

as n — )• 00, and thus by (II. 8p . 

Var^-Ul-iA (5.17) 

3d V m z / 

For fixed m (Theorem II. 3p . the standard central limit theorem for sums 
of i.i.d. random variables now shows that 

and thus (jl.lip follows from (|5.14p . 

For m — > 00 (Theorem II. 4p . we have Var£i —> 1/36 by (J5.17P ; moreover, 
the random variables £j are uniformly bounded (by 1), and thus the central 
limit theorem with e.g. Lyapounov's condition [5j, Theorem 7.2.2] applies 
and shows that 

and thus ([LIB"]) follows from (j5T4"j) . □ 

Remark 5.2. It is interesting to do the decomposition (15. 1QH explicitly. 
Using the centred variables 

777 -\- 1 

X' i :=X i -EX i = X i ^— , (5.20) 

Y(:=Y i -EY i = Y i -^ (5.21) 

we have by (15. 1|) 

E(J tJ I X„ Y,) - *^(1 - r.) = 3 + '"' - "/ 2 (1 - r/), (5.22) 

, , x m-X (m-l)/2-X'/ . 1\ , 

E(Iji X U Y t ) = l -Yi = ± } 1 MF/ + - , 5.23 

m m V 2/ 

and thus, using (|5.2p and (|5.7p . 

fc := E(J*. I X,^) - E/*- = --*&'■ (5-24) 

m 



Hence, the decomposition is 

n - 
2m m m 



I h = \^-lm-lx^ + r ]l] (5.25) 
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and 

fn\ m — 1 2(n — 1) x— * , / „ , 

"- = U) -ss-- "St 4 S *W + * <»■») 

Note also that (|5.17p follows from (|5.24j) . and then (I5.12J) yields, using 

(EHD, 



Va r ,,=Va r /*-2Var^ = I(l--l)-|(l--l) = l(l--l 

(5.27) 
which together with (j5.17j> and (J5.16J) yield another proof of (jl.8p . 



Remark 5.3. It is also interesting to do the corresponding orthogonal de- 
composition of V n:m in (|2.5p . We have, similarly to (|5.10p . 

l{Xi >X j } = /j,' + & + e; + r/ tj , (5.28) 

- \ to— 1 

3) ~ 2m 



where ju' := P(X» > X,-) = ^, and, with X? as in ([QUI) 



£ := P(X > Xj | Xi) -fjt=^i, (5.29) 

m 

X' 
# := P(X > Xj | A>) - // = i -, (5.30) 



m 
and 7^- is defined by (|5.28p . Summing we get, 



v n , m = Ev n , m +^2(n-i)e l +Y J (j- i )t"+ E ^ 

4=1 3=1 l<i<j<Tl 

1 n 

= EK, m + -^(n + l-2f)X / + ^ r,' ir (5.31) 

i=l l<*<i<" 

Straightforward calculations show that 



VarX< = ^(m 2 -l), (5.32) 

Var(l{X i >X,}) = i(l--^), (5.33) 

and, by (|5^51l . 

Var nij = Var(l{X i > X 3 }) - Var £ - Var % = A (l - -L) . (5.34) 

Hence, (|5.3ip yields 

1 n 

Vary «.- = ^2Z]( n+1 - 2i ) 2Var ^ + 2 Varr & 

j=i i<*<i<^ 



n(n — l)(n + 1) / 1 \ n(n — 1) / 1 



2y + ^r^ 1 -^2)' ( 5 - 35 ) 



36 V m J 24 V m : 

which gives yet another proof of (II. 8p 
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We can also prove Theorems 11.31 and 11.41 using (J5.31I) instead of (15.130 ; 
again the final sum can be ignored since, using (15.341) and the fact that the 
rj'i ■ are uncor related, 

M^I^-l^f 1 -?)^^ (536) 



as n — > oo, cf. f|5. 15|) . The summands in ^" =1 (ri+l — 2^)^^ are not identically 
distributed, but that docs not matter since Lyapounov's condition holds. 
See [8|, Corollary 11.20] for a general limit theorem for asymmetric sums like 
(|2.5p . and note that the argument in Section [3] is an instance of a general 
method to convert such sums into (symmetric) [/-statistics by introducing 
the auxiliary variables Yi, see [8|, Remark 11.21]. 

In the case m = 2, one can check that rjL = —X[X'z and thus 

l<i<j<n i=X 

which shows that the decomposition (|5.3ip then is essentially the same as 
the decomposition used by Takacs [la]. 



Proof of Theorem \2.4\ We use the decomposition (|5.3ip of V n ^ m , and iVj. = 
Y^l = i l{^j = k}. The result follows by the central limit theorem with 
Lyapounov's condition applied to the random vector 

TJl=i(n + l-2i)X[ N x -W.Ni N m -MN m 



n I n I n I 

_ A / (n + 1 - 2i)X[ l{Xi = 1} - 1/m 1{X< = m} - 1/m 

~~ Z_> I ^372 ' ^T/2 '■■■' ^I/l 

i=l 

together with (|5.3ip and (|5.36P ; the variances and covariances are easily 
computed, noting that Cov(^™ =1 (n + 1 — 2i)X[^ Y17=x l{-^i = fe}) = for 
each k since 5^Li(«+ 1 — ^) = 0- (This vector- valued central limit theorem 
follows, as is well-known, from the real-valued version [5j, Theorem 7.2.2] by 
the Cramer- Wold device [a, Theorem 5.10.5].) □ 

6. Proof of Theorem 11.51 

To prove the local limit theorem Theorem 11.51 we need estimates of the 
probability generating function flft (?) = m~ n Gn (?) for q = e l9 on the 
unit circle. We derive these estimates from the corresponding estimates of 

( n ) in |3j rather than from scratch. (We do not know whether the 
\ni,...,n m / q ^J \ 

estimates below are the best possible.) 

Consider a random word W n>m as in Section [2j let again Ni , . . . , N m be 

the number of occurrences of the different letters, and let N* := maxfc< n Nk 
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and N* := n — N*. Similarly, for given rei, . . . , n m with n\ H + n m = n, 

let re* := maxfc< n n^ and re* := n — n*; let further 

Fni '-' nm(9):= U,.. n ,njyU,.. n ,n, 

be the probability generating function of the number of inversions in a ran- 
dom word consisting of n\ l's, ..., n m m's, cf. (|2.2h . Thus i ? ni,...,n m (Q') 
is the probability generating function of Ki,m = In.v(W n)jre ) conditioned on 
N k = nk, k = l,...,m. 

Lemma 6.1. There exists c > such that for all m > 2, n > 2 and real 

0£ [-TT,ir], 

WW < l e ' Cn " 2 - °f m ^ l/ "- (6.1) 

|y ™ V >] ~ \e- cn , l/n<\6\<ir. V ; 

Proof. We assume in the proof for simplicity that n is large enough; this 
case is enough for our application in Theorem 11.51 It is easy (but not very 
interesting) to complete the proof by verifying the estimates (|6.ip for each 
fixed n > 2 and some c (that now might depend on n); we omit the details 
but mention that the case when m is large follows using Theorem 11.21 We 
let ci, Co, . . . denote some positive constants whose values are not important. 
By [3j, Lemma 4.1] there exists r E (0, 1) such that if \8\ < r/n, then for 
any m, . . . , n m with n\ + • • • + n m = n, 

where a 2 depends on m,...,n m and by [3, Lemma 3.1] a 2 > n 2 n*/36. 
Furthermore, by [3j, Lemma 4.4] there exists c\ > such that if r/n < \9\ < 
7T, then 

IF (e W )\ < p ~ Cin * 

Hence, if n* < 3n/4 so that n* > re/4 we have the estimates 

|Fn 1 ,...,n m (e ie )|<e- C2 ™ 3e2 , |0|<r/re, (6.2) 

and 

|^« 1 ,.,n m (e M ')| < e" C3n , r/n < |d| < vr. (6.3) 

We return to our string W n ^ m with random numbers N\, . . . , N m of dif- 
ferent letters. We can, for any m > 2, partition {1, . . . ,m} into three sets 
with at most rre/2 elements each, and thus 

P(iV* > 3re/4) < 3P(Bi(re, 1/2) > 3re/4) < 3e _C4n (6.4) 

by Chernoff's inequality, see e.g. (a, Theorem 2.1]. 
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When \9\ < r/n, which implies n 3 9 2 = 0(n), we obtain by (J6.2H and 

\g^{e W )\ = \Ee i9V "> m \ 

= E(e WVn ' m | N* < 3n/4) P(iV* < 3n/4) 

+ E(e ieVn ' m | N* > 3n/4) P(iV* > 3n/4) 

< e" C2n3e2 F(N* < 3n/4) + P(iV* > 3n/4) 

< e - C2n3e2 + 3e" C4n 

This verifies (|6.ip with some c > for cgra~ 3 ' 2 < |#| < r/n. 

For \6\ < cqu' 3 / 2 , we first note that F(N* < 3n/4) > c 7 > for all 
m, n > 2; this holds for all large n by (J6.4J) (and is easily seen for each fixed 
n). Hence, by the calculations in ([6.5p . 

1 - |^ m) (e i9 )| > 1 " W(N* > 3n/4) - F(N* < 3n/4)e" C2n302 

= ¥{N* < 3n/4)(l - e - C2nH2 ) > c 7 c 8 n 3 2 , 

verifying (|6.ip in this case too (for c < c-jc^). 

Finally, for r/n < \9\ < ir, we obtain by arguing as in ()6.5|) . now using 
(ESD and (ET 



|^ m )(e ie )| < e- C3n P(iV* < 3n/4) +F(N* > 3n/4) < e~ C3n + 3e~ C4n 
< e~ C9n , 

provided n is large enough. This completes the proof (for large n) for the 
cases r/n < \9\ < 1/n and 1/n < \9\ < tv. D 

Proof of Theorem I J. 51 Consider any sequence m = m(n) > 2. We will 
show that (J1.14D holds uniformly in k for any such sequence m(n); this is 
equivalent to the asserted uniform convergence for all m > 2. 

Denote the characteristic function of G n ^ m by tp n (9), and recall that it is 

given by (p n (0) = 9n (e 10 ), see (|1.6j) . It follows from Theorems 11.31 and 11.41 
that 

G n , m - /i n , m J^ N{ ^ ^ (g6) 

0~n,m 

as n — )• oo. (To see this we may by considering subsequences assume that 
m(n) converges to either a finite limit or to oo; then (|6.6|) is (|1.1U|) or (|1.12|) .) 
Thus, by the continuity theorem, for any fixed 9 £M, 

e- i6 ^™l°^ip n {9/o- n>m ) -)• e~ e2 ' 2 . (6.7) 

Let 

r n {9) := e- w ^^ip n (9/* ntm )l{\9\ < ira n , m } - e^ 2 , (6.8) 
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and note that r n (8) — > as n — > oo for each fixed 6 by (|6.7p since a njm — > oo 
by (USD- 

By Fourier inversion we have 



^, m P(G n , m = k) = ^ j* e~ ifc V(t)dt 



2tt /_ to - 






2tt 

1 f 00 i 

2vr y.oo V27T 

and thus, for all k € Z, 



oo 

_!_ / c i(Hn.,n-k)8/a n . mr (Q\ ^Q | 1 c ~ Q*n.m ~fc) 2 /2g-g „, 






1 f 00 

<^/_ |r.W|d.. 



The result (|1.14p follows since 

/oo 
|r n (0)|d#^O 
-oo 

as n — > oo by dominated convergence, using Lemma l6.1| note that if \9\ < 
Ko- n ,Tn, then \0\ < n 3 / 2 since ^ 2 o\ m < n 3 by (11 .8[> . and hence (16, ip yields 

Wn{0/a n , m )\ = \g { ™He w ^-™)\ < e~™ ie2 K™ + e ~ cn < e"^ + e"^ 3 ; 
hence, for all n > 2 and ^£f, 

\r n {e)\<2e-^+e-^ r \ 

The version with /2 n)m and a 2 m follows in exactly the same way, starting 
with 

Gn ' m ~^ n ' m AiV(0,l), (6.9) 

which is equivalent to (|6.6p since o\ m ~ cr 2 m and fi nt m = ^n,m + o(a ntTn ) 
as n — )• oo by Theorem II .11 □ 

References 

[1] G. E. Andrews, The Theory of Partitions, Addison- Wesley, Reading, 
Mass., 1976. 

[2] T. Bliem and S. Kousidis, The number of flags in finite vector 
spaces: asymptotic normality and Mahonian statistics. Preprint, 2011. 
larXiv~1 109.4624Tl 

[3] E. R. Canfield, S. Janson and D. Zeilberger, The Maho- 
nian probability distribution on words is asymptotically nor- 
mal. Adv. Appl. Math. 46 (2011), no. 1-4, 109-124. Erratum: 



http : //www2 . math . uu . se/~svant e/papers/s j 239- erratum . pdf 



16 SVANTE JANSON 



[4 

[5 

[6 

[T 
[8 
[9 

[10 

[11 

[12 

[13 

[14 

[15 
[16 

[17; 



J. Goldman and G.-C. Rota, The number of subspaces of a vector 
space. Recent Progress in Combinatorics (Proc. Third Waterloo Conf. 
on Combinatorics, 1968), Academic Press, New York, 1969, pp. 75-83. 
Gut, A., Probability: A Graduate Course, Springer, New York, 2005. 
Corrected 2nd printing 2007. 

S. Hitzemann and W. Hochstattler, On the combinatorics of Galois 
numbers. Discrete Math. 310 (2010), no. 24, 3551-3557. 
W. Hoeffding, A class of statistics with asymptotically normal distri- 
bution. Ann. Math. Statistics 19 (1948), 293-325. 
S. Janson, Gaussian Hilbert Spaces. Cambridge Univ. Press, Cambridge, 
1997. 

S. Janson, T. Luczak & A. Rucihski, Random Graphs. Wiley, New York, 
2000. 

V. Kac & P. Cheung, Quantum Calculus, Springer, New York, 2002. 
A. Nijenhuis, A. E. Solow and H. S. Wilf, Bijective methods in the 
theory of finite vector spaces. J. Combin. Theory Ser. A 37 (1984), no. 
1, 80-84. 

G. Polya, Gaussian binomial coefficients and the enumeration of inver- 
sions. Proc. Second Chapel Hill Conf. on Combinatorial Mathematics 
and its Applications (Univ. North Carolina, Chapel Hill, N.C., 1970), 
pp. 381-384, Univ. North Carolina, Chapel Hill, N.C., 1970. 
L. Rogers, On a three-fold symmetry in the elements of Heine's series. 
Proc. Lond. Math. Soc. 24 (1893), 171-179. 

U. Schwerdtfeger, Volume laws for boxed plane partitions and area laws 
for Ferrers diagrams. Proceedings, Fifth Colloquium on Mathematics 
and Computer Science (Nancy, 2008), Discrete Math. Theor. Comput. 
Sci. Proc. AI, 531-539. 

R. P. Stanley, Enumerative Combinatorics, Volume I. Cambridge Univ. 
Press, Cambridge, 1997. 

L. Takacs, Some asymptotic formulas for lattice paths. J. Statist. Plann. 
Inference 14 (1986), no. 1, 123-142. 

C. R. Vinroot, Multivariate Rogers-Szego polynomials and flags in finite 
vector spaces. Preprint, 2010. arXiv: 1011 .0984. 



Department of Mathematics, Uppsala University, PO Box 480, SE-751 06 
Uppsala, Sweden 

E-mail address: svante.janson@math.uu.se 
URL: http://www.math.uu. se/~svante/ 



